The COST 249 SpeechDat Multilingual Reference Recogniser

نویسندگان

Finn Tore Johansen

Narada D. Warakagoda

Børge Lindberg

Gunnar Lehtinen

Zdravko Kacic

Andrej Zgank

Kjell Elenius

Giampiero Salvi

چکیده

The COST 249 SpeechDat reference recogniser is a fully automatic, language-independent training procedure for building a phonetic recogniser. It relies on the HTK toolkit and a SpeechDat(II) compatible database. The recogniser is designed to serve as a reference system in multilingual recognition research. This paper documents version 0.93 of the reference recogniser and presents results on smallvocabulary recognition for seven languages.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Noise Robust Multilingual Reference Recogniser Based on Speechdat(II)

An important aspect of noise robustness of automatic speech recognisers (ASR) is the proper handling of non-speech acoustic events. The present paper describes further improvements of an already existing reference recogniser towards achieving such kind of robustness. The reference recogniser applied is the COST 249 SpeechDat reference recogniser, which is a fully automatic, language-independent...

متن کامل

The basque speech_dat (II) database: a description and first test recognition results

In this work we present a telephone speech database for Basque, compliant with the guidelines of the Speechdat project. The database contains 1060 calls from the fixed telephone network. We first describe the main aspects of the database design. We also present the recognition results using the database and a set of procedures following the language independent reference recogniser commonly nam...

متن کامل

The Development and Integration of the LDA-Toolkit Into COST249 SpeechDat(II) SIG Reference Recognizer

This paper presents the development of Linear Discriminant Analysis toolkit (LDA-Toolkit) and its integration into widely used COST249 SpeechDat(II) Task Force Reference Recognizer (RefRec). The crucial parts of the LDA, the determination of LDA classes, as well as the influence of the level of dimensionality reduction on automatic speech recognition performance, are discussed. Evaluation of pr...

متن کامل

Phoneme-based recognition for the norwegian speechdat(II) database

This paper presents results from a number of exible vocabulary recognition experiments on the Norwegian SpeechDat(II) database. A common phoneme-based recogniser design procedure is tested on ve di erent tasks, and for ve di erent training sets. Results verify that reasonably accurate recognisers can be built with the database, using standard HMM techniques. They also quantify the importance of...

متن کامل

Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering

The paper describes our ongoing work on crosslingual speech recognition based on multilingual triphone hidden Markov models. Multilingual acoustic models were built using two different clustering procedures: agglomerative triphone clustering and tree-based triphone clustering. The agglomerative clustering procedure is based on measuring the similarity of triphones on a phoneme level where the m...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

The COST 249 SpeechDat Multilingual Reference Recogniser

نویسندگان

چکیده

منابع مشابه

A Noise Robust Multilingual Reference Recogniser Based on Speechdat(II)

The basque speech_dat (II) database: a description and first test recognition results

The Development and Integration of the LDA-Toolkit Into COST249 SpeechDat(II) SIG Reference Recognizer

Phoneme-based recognition for the norwegian speechdat(II) database

Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering

عنوان ژورنال:

اشتراک گذاری